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(54) Title: COMPOSITIONS AND METHODS FOR GENERATING EXPRESSION VECTORS THROUGH SITE-SPECIFIC 
RECOMBINATION 

(57) Abstract: Compositions, kits, and methods are provided for use in a recombinationa] cloning or subcloning methods for con- 
structing expression vectors which comprise: ligating a libraiy of double-stranded linear donor DNAs, where each member of the 
library includes a donor DNA sequence, with a double-stranded linear driver DNA which includes a promoter sequence and a donor 
recombination site to form a single circular donor DNA, the single circular donor DNA not including an origin of replication, where 
the donor DNA sequence is under the transcriptional control of the promoter, and contacting the circular donor DNA and a circular 
acceptor vector in the presence of a recombinase to form a single fiised circular vector, the circular acceptor vector comprising an 
origin of replication and an acceptor recombination site capable of recombining with the circular donor DNA. 
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COMPOSITIONS AND METHODS FOR GENERATING EXPRESSION 
VECTORS THROUGH SITE-SPECIFIC RECOMBINATION 

5 

Field of tha Invypfffffi 

Thfe Invention tBiafes to recombinant DNA technology, nucleic 
acids, vectors and methods for use In a recomblnatlonal cloning or 
subcloning, and more specifically for constructing expression vectors by 
1 0 using recombination proteins In vitro or In vivo through site-specific 
recombination. 

Description of Related Art 

Recombinant DNA technology, also called gene cloning or 

1 5 molecular cloning, is widely used to transfier genetic Infbmnatlon, I.e. 
DNA, from one organism to another. A typical recombinant DNA 
experiment often follows the following procedure. First, the DNA (e.g., 
the cloned DNA, insert DNA, target DNA, or foreign DNA) from a donor 
organism Is extracted, enzymaticaily cleaved (or cut/digested), and 

20 joined (ilgated) to another DNA entity (e.g. a cloning vector) to form a 
new, recombinant DNA molecule (or cloning vector-insert DNA 
construct). Second, this cloning vector-insert DNA construct is 
transfierred Into and maintained within a host cell, such as 
transfonnation of a bacterial host cell by the construct. Third, those host 

25 cells that take up the DNA constnict (transfonmed ceils) are identified 
and selected from those that do not. In addition, if required, a DNA 
construct can be prepared to ensure that the protein product that Is 
encoded by the cloned DNA sequence is produced by the host cell. 

30 Accordingly, this traditional cloning methods using restriction 

enzymes and ligase can be time consuming, especially when a specific 
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expression vector is required for transfsrring tlie target gene into a 
heterologous iiost cell, such as a mammalian cell. The specific 
expression vector may not contain matching restriction sites for the 
donor DNA. Extensive reenglneering of the expression vector may be 
5 required to introduce the matching restriction sites into the vector so that 
the vector and the insert DNA can be iigated to produce the final 
construct AHematively, multiple restriction enzymes may have to be 
employed to generate an insert DNA having suitable restriction sites for 
ligation with the vector. In this case, reaction conditions for each 

1 0 restriction enzyme may differ such that it is often necessary to perform a 
few separate restriction digestion reacttons to obtain the desired insert. 
Further, the efficiency of direct ligation between the vector and insert 
may be very low, especially between large fragments. As a result, the 
whole procedure is tedious, and the final yield of the correctly iigated 

1 5 construct can be low. 



Site-specific recombination represents another useful method of 
recombinant DNA technology. This method employs a site-specific 
recomblnase, an enzyme which catalyzes the ^change of DNA 

20 segments at specific recombination sites. Site-specific recombinases 
present in some viruses and bacteria, and have been characterized to 
have both endonuciease and ligase properties. These recombinases, 
along with associated proteins in some cases, recognize specific 
sequences of t>ase8 in DNA and exchange the DNA segments fianlcing 

25 those segmente. Landy, A. (1 993) Cunent Opinion in Biotechnology 
3.699-707. 



A typical site-specific recomblnase is Cre recomblnase. Ore is a 
38-i(Da product of the cre (cyciization recombination) gene of 
30 bacteriophage P1 and is a site-specific DNA recomblnase of the Int 
family. Sternberg. N. etai. (1986) J. Mol. Biol. 187: 197-212. Cre 
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recognizes a 34-bp site on the P1 genome called loxP (locus of X-over 
of P1) and efficiently catalyzes reclpnDcal conservative DMA 
recombination between pairs of loxP sites. The loxP site consists of two 
13-bp inverted repeats flanking an 8-bp nonpalindromic core region. 
5 Cre-mediated recombination between two directly repeated loxP sites 
results in excision of DMA between them as a covalently closed circle. 
Cre-mediated recombination between pairs of loxP sites in inverted 
orientation will result in inversion of the inten/enihg DMA rather than 
excision. Breaking and joining of DMA is confined to discrete positions 
1 0 within the core region and proceeds on strand at a time by way of 

transient phophotyrosine DNA-proteIn linkage wWh the enzyme. Other 
examples of site-specific recombination systems include the 
integrase/att system form bacteriophage X, and the FLP/FRT system 
from the Sacchammyces ceravlsiae 2pi circle plasmid. 

16 

These site-specific recombination systems have been used In 
vivo to facilitate recombination between different vectors. Waterhouse 
at ai. used an In vivo method to Join light and heavy chains of an 
antibody. The light and heavy chains were cloned in different phage 

20 vectors between loxP and loxP 51 1 sites that were used to transfonn 
new E. CO// cells. Waterhouse, P. et al. (1993) Nucleic Acid Res. 
21:2265-2266. Ore acted on two parental molecules, one plasmid and 
another phage, in the host ceils to produce four products in equilibrium: 
two different cointegrates (produced by recombination at either ioxP or 

25 ioxP51 1 sites), and two daughter molecules, one of which was the 
desired product. Schiake and Bode used an In vhro method to 
exchange expression cassettes at defined chromosomal locations, each 
flanked by a wild fype and spacer-mutated FRT recombination site. 
Schiake and Bode (1994) Biochemistry 33:12746-12751. A double- 

30 reciprocal crossover was mediated in cultured mammalian cells by using 
the FLP/FRT system for site-specific recombination. Aoki et al. used a 
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Shuttle plasmid (pAdMCS) that carried a gene of interest, a loxP site, the 
adenoviral 5-LTR and packaging signal 0 to 1 mu, and a multiple cloning 
site. Aoki et al. (1999) Mol. Med. 5:224-231 The shuttle plasmid was 
linearized by a restriction enzyme Nhel and recomblned with Clal- 
5 digested adenoviral cosmid in vitro. Cre recombinase produced the full- 
length recombinant adenoviral vector In vitro by an exchange of region 
distal to the loxP she linearized in these two molecules. 

10 SUMMARY OF THE INVENTION 

The present invention relates to compositions, kits, and methods 
for use in a recombinational cloning or subcioning. In particular, the 
present invention provides novel methods for constructing expression 
15 vectors by using site-specific recombinases in vitro. These method may 
be used for high throughput screening of genes, functional genomics 
and other human genome projects. 

In one aspect, the present invention provides a double-stranded 
20 drcular donor DNA for transfening a donor DNA sequence into 

expression vectors. The circular donor DNA comprises: a donor DNA 
sequence; a donor recombination site; at least one selectable marker, 
the circular donor DNA not including an origin of replication. 

25 The donor DNA sequence may be any gene of interest or any 

synthetic DNA sequence which Is needed to be transfened Into an 
expression vector. For example the donor DNA segment may be a 
sequent derived from cDNA of a particular gene or one of the • 
members of a cDNA library. The donor DNA may also be a genomic 

30 DNA that contains the coding region intenupted with non-coding 
sequences. 
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In another aspect, the present invention also provides a library of 
double-stranded circular donor DMAs that may be used for high 
throughput screening. The library of double-stranded circular DIVIA 
6 comprises: a donor DNA sequence which varies within a library of donor 
DNA sequences; a donor recombination site; and at least one selectable 
marker, the circular donor DNA not including an origin of replication. 

The library of donor DNA sequences may be a library of cDNA or 
1 0 genomic DNA derived from any desirable sources. For example, the 
library of donor DNA sequences may be a cDNA library from single 
human chromosomes. 

The circular donor DNA may further comprise a promoter 
1 5 sequence that controls expression of the donor DNA sequence. The 
promoter may be any anray of DNA sequences that interact specifically 
with cellular transcription Motors to regulate transcription of the 
downstream gene. The promoter may be derived from any organism, 
such as bacteria, yeast, insect and mammalian cells and viruses. 
20 Examples of the promoter include, but are not limited to, E. goH lao and 
trp operons, the tac promoter, the bacteriophage ^ p*- promoter, 
bacteriophage T7 and SP6 promoters, jJ-actin promoter, insulin 
promoter, human cytomegalovims (CMV) promoter, HIV-LTR (HiV-iong 
tenninal repeat), Rous sarcoma virus RSV-LTR, simian virus SV40 
25 promoter, baculoviiai polyhedrin and plO promoter. 

The promoter may also be an indudble promoter that regulates 
the expression of downstream gene in a controlled manner. Examples 
of inducible promoters include, but are not limited to, the bacterial dual 
30 promoter (activator/represser expression system) which regulates gene 
expression in mammalian cells under the control of tetracycline and its 
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10 



analogs and promoters that regulate gene expression under the control 
of lactors such as heat shocks, steroid honnones, heavy metals, phorbol 
ester, the adenovirus E1A element, interferon, or semm. 

The donor recombination site may be any segment or arrays of 
DNA sequence recognized by a site-specific recombinase which 
catalyzes site-specific fusion between the circular donor DNA and an 
acceptor vector. The site-specific recombinase may be a recombinase, 
a transposase or an integrases. 



In one variation, the recombination site is a lox site that is 
recognized by the'Cre recombinase of bacteriophage PI. Example of lox 
site includes, but are not limited to, loxB, loxL, loxR, loxP [SEQ ID 
N0:1], loxP3, loxP23, loxA86. loxAl17, ioxP511 [SEQ ID N0:2], and 
15 loxC2 [SEQ ID N0:3]. 



In another variation, the recombination site is a recombination 
site that is recognized by a recombinases other than Ore. Examples of 
the non-Cre recombinases include, but are not limited to, site-specific 

20 recombinases include: att sites recognized by the Int recombinase of 
bacteriophage X (e.g. attl. att2, attS, attP, attB, attL, and attR), the FRT 
sites recognized by FLP recombinase of the 2pi plasmid of 
Sacchammyces cerevisiae, the recombination sites recognized by the 
resolvase family, and the recombination site recognized by transposase 

25 of Badllus Viruingiensls. 

The example of site-specific recombinase include, but are not 
limited to, bacteriophage PI Cre recombinase, yeast FLP recombinase, 
Inti integrase, bacteriophage K phi 80. P22, P2, 186, and P4 
30 recombinase, Tn3 resolvase, the Hin recombinase, and the CIn 

recombinase, £. co//xerC and xerD recombinases, Bacillus thuiingtensis 
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recombinase, Tpnl and the p-lactamase transposons, and the 
immunoglobulin recombinases, 

The selectable marl<er of the circular donor DMA may be any 
5 functional element for facilitating subsequent identification and selection 
of clones of the recombination product under suitable conditions. The 
selectable maricer may encode any functional element, such as protein, 
peptide, RNA, binding site for RNA and proteins, or products that 
provide resistance to organic or inorganic agents. Examples of 

10 selectable markers include, but are not limited to, reporter genes such 
as D-galactosidase (GAL), fluorescent proteins (e.g., GFP, GFP-UV, 
EFFP, BFP, EBFP, ECFP, EYFP), secreted forni of human placental 
alkaline phosphatase (SEAP), p-giucuronidase (GUS)); resistance 
genes against antibiotics (e.g. neomycin (G418) or hygromycin resistant 

15 gene, puromycin resistant gene), yeast seletable markers Ieu2-d and 
URA3, apoptosis resistant genes (e.g. bacuioviral p35 gene), and 
antisenoligonucleotides. 

The circular donor DMA may optionally include an affinity tag for 
20 selection and isolation of protein product encoded by the donor DMA 
segment. Examples of such an affinity tag include, but are not limited 
to; a polyhistldlne tract, polyarginine, glutathione-S- transferase (GST), 
maltose binding protein (MBP), a portion of staphylococcal protein A 
(SPA), and various immunoaffinity tags (e.g. protein A) and epitope tags 
25 such as those recognized by the EE (Glu-Glu) antipeptide antibodies. 
The affinity tag may be positioned at either the amino- or carboxy- 
temilnus of the donor DNA. 

The present Invention also provides a circular acceptor vector for 
30 generating recombinant expression vector. The vector comprises an 
origin of replication; and an acceptor recombination site capable of 
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recomblning with a donor DMA. Optionally, the acceptor vector may not 
include a promoter for regulating expression of the donor DMA. 

The circular acceptor vector may be any vector that can 
5 transform, transfect or transduce a host cell. The acceptor vector may 
be a plasmid, a phage or a viral vector as long as it is able to replicate in 
vitro or in a host celt, or to convey the donor DNA to a desired location 
within a host cell. Examples of host cells include, but are not limited to, 
bacterial (e.g. E coli, Bacillus subtllis, etc.), yeast, animal, plant and 
10 insect cells. 

In one variation, the circular acceptor vector may be a prokaryotic 
plasmid. Optionally, the acceptor vector may comprise a prokaryotic 
termination sequence. Examples of the prokaryotic tennination 
15 sequence include, but are not limited to, the T7 tennination sequence, 
the Tint, Tu. Tia, Tu, Tl^, TRg, Tes temiination signals derived from the 
bacteriophage X. 

In another variation, the circular acceptor vector may be a 
20 mammalian expression vector. The mammalian expression vector 

contains one or more eukaryotic marker genes, appropriate eukaryotic 
transcriptional and translatlonal tennination signals and a sequence that 
signals polyadenyiation of the transcript messenger RNA (mRNA), and 
an origin of replication that functions in a mammalian host cell. 
25 Examples of the eukaryotic polyadenyiation sequence include, but are 
not limited to, the Herpes simplex virus thymkline kinase 
polyadenyiation sequence, the bovine growth hormone polyadenyiation 
sequence, and the simian virus 40 polyadenyiation sequence. 

30 Optionally, the eukaryotic expression vector may also cany an 

origin of replication and selectable marker genes that function In 
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bacterial cells, forming a shuttle vector. 

In yet another variation the circular acceptor Includes a promoter 
for regulating expression of the donor DNA sequence canled by a 
6 circular donor DNA of the invention. According to this variation, the 
recombination site may be placed downstream of the promoter and the 
transcription initiation site in the acceptor vector. 

In yet another variation, the circular acceptor may be a yeast 
10 expression vector such as a S. cemvl^ae expression vector. Various 
types of S. CG/Bv/s/ae expression vector include, but are not limited to, 
episomai or plasmid vector, integrating vectors, and yeast chromosomes 
(YACs). 

15 In yet another variation, the circular acceptor vector may be a 

baculovlrus DNA, such as wild type or mutant genomes o1 Autographa 
califbmica multipla nuclear polyhedrosis virus (Acl\4NPV) virus. 

Optionally, a baculovlral acceptor vector according to the present 
20 Invention may not contain a polyhedrin promoter. Instead, the 

polyhedrin or the baculovlral p10 promoter can be positioned upstream 
of the donor DNA sequence of the circular donor DNA of the present 
invention. 

25 The present invention also provides Idts for generating 

recombinant vectors. In one embodiment, the kK comprises: a double- 
stranded circular donor DNA comprising a donor DNA sequence, a 
donor recombination site, and at least one selectable marker, the 
circular donor DNA not including an origin of replication; and a circular 

30 acceptor vector comprising an origin of replication and an acceptor 

recombination site capable of recombining with the circular donor DNA. 
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In another embodiment, the kit comprises: a library of double- 
stranded circular donor DNA comprising a donor DNA sequence which 
varies within a library of donor DNA sequences, a donor recombination 
5 site, and at least one selectable marker, the circular donor DNA not 
including an origin of replication; and a circular acceptor vector 
comprising an origin of replication and an acceptor recombination site 
capable of recomblning with the circular donor DNA. 

10 In yet another embodiment, the kit comprises: one or more linear 

donor DNA comprising a donor DNA sequence; a linear driver DI>1A 
comprising a promoter sequence, a recombination site, and at least one 
selectable marker, ligation of the linear donor DNA and the linear driver 
DNA resulting In a circular donor DNA; and a circular acceptor vector 

15 comprising an origin of replication and an acceptor recombination site 
capable of recomblning with the circular donor DNA. 

Hie present Invention also provides a method for generating 
recombinant expression vector in vitro through site-specific 

20 recombination between a circular donor DNA and circular acceptor 
DNA, each containing recombination site recognized by the 
recombinase. The method comprises: contacting a circular double- 
stranded donor DNA and a circular acceptor vector in the presence of a 
recombinase under conditions suitable for the circular double-stranded 

25 donor DNA and circular acceptor vector to recombine to form a single 
fused circular vector. In this method, the circular double-stranded donor 
DNA comprises a donor DNA sequence, a donor recombination site, 
and at least one selectable marker, but not including an origin of 
replication. The circular acceptor vector comprises an origin of 

30 replication and an acceptor recombination site capable of recomblning 
with the circular donor DNA. The promoter for regulating expression of 
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the donor DMA may be contained in either the donor DNA or acceptor 
vector. 

According to this method, the circular donor DNA containing a 
6 site-specific recombination site may be recomblned with a circular 

acceptor vector In the presence of Cre recombinase. The fBcombinatlon 
sites on the circular donor DNA and the circular acceptor vector may 
each contain a lox site. 

1 0 The method may further Include steps of transforming, 

transfecting or transducing a host cell and selecting the con^ctly fused 
recombinant vector based on the selectable phenotype conferred by the 
selectable marker gene on the recombinant vector. 

15 The present invention also provides a method for generating 

recombinant expression vectors from linear DI^IA segments in vitro. The 
method comprises: llgating one or more double-stranded linear donor 
DNA which includes a donor DNA sequence with a double-stranded 
linear driver DNA which Includes a promoter sequence and a donor 

20 recombination site to fonn a single circular donor DNA, the singular 
circular donor DNA not including an origin of replication, where the 
donor DNA sequence is under the transcriptional control of the 
promoter; and contacting the circular donor DNA and a circular acceptor 
acceptor vector in the presence of a recombinase to fbnn a single fused 

25 circular vector. In this method, the circular acceptor vector comprises 
an origin of replication and an acceptor recombination site capable of 
recombining with the circular donor DNA. 

According to this method, the linear donor DNA and linear driver 
30 DNA may contain matching restriction sites or other type of annealing 
sites so as to be ligated to form a circulaized DNA. The linear donor 
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and driver DMAs may be derived from PGR amplification products. 

The present Invention also provides a method for high throughput 
production of recombinant expression vectors from linear Dl^ 
5 segments in vitro. The method comprises: llgating a library of double- 
stranded linear donor DMAs, where each member of the library Includes 
a donor DNA sequence, with a double-stranded linear driver DMA which 
Includes a promoter sequence and a donor recombination site to fbmi a 
single circular donor Dl^, the singular circular donor DNA not Including 

10 an origin of replication, where the donor DNA sequence is under the 

transcriptional control of the promoter; and contacting the circular donor 
DNA and a circular acceptor acceptor vector in the presence of a 
recombinase to form a single fused circular vector, in this method, the 
circular acceptor vector comprises an origin of replication and an 

1 5 acceptor recombination site capable of recomblning with the circular 
donor DNA. 

According to this method, the library of double-stranded linear 
donor DNAs may be DNAs amplified firom a library of cDNA clones. The 
20 library of cDNA clones may be arrayed In a multi-well plate suc^ as 96- 
and 384-well plates. The library of cDNA clones may be a cosmid or 
phage library. 

Also according to the method, ligating the library of double- 
25 stranded linear donor DNAs with a double-stranded linear driver DNA 
may be peribmned by Ligation Independent Cloning (LIC). Alternatively, 
ligating the library of double-stranded linear donor DNAs with a double- 
stranded linear driver DNA may be perfonned in the presence of T4 
DNA llgase. 

30 

The method may further include a step of transfemhg the 
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recombinant expression vector Into a host and Isolating the protein 
expressed from the vector by affinity tagging. The affinity tagging may 
be based on a polyhlstidine tag (e.g. HiSe), a protein tag (e.g., GST, 
maltose binding protein) or an epitope tag (e.g. an EE ag). 

5 

The methods of the present invention allow rapid and efficient 
generation of expression vectors containing the gene of interest without 
bacterial cloning. Direct ligation of linear donor DMA and linear driver 
DMA to generate a circular donor DNA allows for efficient cloning of 
1 0 donor DNA such as a cDNA library into an expression vector in an 

automated and high throughput manner. The methods can be used in a 
wide variety of high throughput arrays for functional genomics, protein 
genomics (proteomics), and other human genome projects. 

16 BRIEF DESCRIPTION OF FIGURES 

Figure 1 illustrates a process of constaicting a library of 
baculoviral expression vectors through Cre-mediated site-specific 
recombination. 

20 

Figure 2 illustrates a process of constructing a baculoviral 
expression vector for the GUS gene through Cre-mediated site-specific 
recombination. 

25 

PETAIIPP Dg$CRtPT!QN Of THg INVENTIQN 

The present invention provides reagents, kits and methods for 
use In a recomblnational cloning or subcloning process, and, in 
30 particular, for constructing expression vectors by using a site-specific 
recombinase in vitro or in vivo. In one aspect, a method is used to 
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directly fuse a linear segment of donor DNA (e.g., gene of Interest) with 
another linear segment of DNA comprising various functional elements 
such as promoters, selectable mariners and a recombination site, 
resulting In a single circular donor DNA. This circular donor DNA Is then 
5 recombined into a circular acceptor vector which also contains a 

recombination site through site-specific recombination catalyzed by a 
reoombinase. The recombination product can be used to transform, 
transact or transduce various types of host cells, depending on the 
specific type of acceptor vector used. 

10 

The circular donor DNA need not carry an origin of replication for 
propagation In host cells such as bacterial cells. Instead, the circular 
donor DNA may be produced from directly llgating two or more linear 
segments of DNA which may be amplified by polymerase chain reaction 

1 5 (PCR). Such a separation and ligation of difiierent segments of DNA 
allows flexible distribution of different elements among the linear 
segments. For example, one linear segment may contain the gene of 
interest amplified firom a cDNA library, while the other linear segment 
contains functionai elements essential for subsequent recombination in 

20 vitro (or in vivo) and expression In host cells. By using a site-specific 
recomblnase, such as Ore reoombinase, this circular DNA can be 
recombined into any gene-transfemng vector without using restriction 
enzymes as long as the vector cames a recombination site recognized 
by the reoombinase. Further, direct ligation of linear segments of DNA 

25 avoids laborious steps of bacterial cloning and fecllltates high 

throughput screening of large library of genetic materials, such as cDNA 
libraries derived from diseased tissues or cells. In addition, the circular 
DNA produced by direct ligation of these segments can be Itee of other 
undesirable genetic materials such as "junk DNA" derived from a 

30 bacterial plasmid that may affect expression, viability or stability of the 
recombinant vector. 
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10 



1 . Circular Donor DNA 

The present Invention provides a double-stranded circular donor 
DNA for transferring a donor DNA sequence Into expression vectors. 
The circular donor DNA comprises: a donor DNA sequence; a donor 
recombination site; at least one selectable marker, the circular donor 
DNA not Including an origin of replication. The donor DNA sequence 
may be aiiy gene of Interest which is needed to be transfened into an 
expression vector. 



The present Invention also provides a library of double-stranded 
circular donor DNAs which comprises: a donor DNA sequence which 
varies within a library of donor DNA sequences; a donor recombination 
site; and at least one selectable mariner, the circular donor DNA not 
15 Including an origin of replication. 



The circular donor DNA contains a donor DNA segment (either 
cDNA or genomic DNA), a promoter (e.g. SV40 early gene enhancer), a 
selectable maricer (e.g. Neo gene), and a sequence-specific 

20 recombinase target site (e.g. a loxP site). The promoter controls 

expression of the gene of interest and the selectable marker gene when 
the circular donor DNA is recombined with an acceptor vector and the 
resulting recombinant vector is Introduced Into a host cell. The circular 
donor DNA may further contain a poiyadenylation signal for expression 

25 in mammalian cells. 



The donor DNA sequence may be any deoxyrlbonucleotide 
sequence encoding a functional gene or any synthetically generated 
DNA sequence. For example the donor DNA segment may be a 
30 sequence derived frorn cDNA of a particular gene or one of the 

members of a cDNA library. The cDNA library may be produced by 
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converting mRNAs In a sample Into double-stranded complementary 
DMA (cDNA) by using reverse transcriptase (RT) and the Klenow 
fragment of nucleic acid polymerase I. Depending on the source of 
mRNA sample, the cDNA library may contain various populations of 
6 genes of interest, such as disease genes located in certain tissue or 

type of cells. The donor DMA may also be a genomic DMA that contains 
the coding region Intenxipted with non-coding sequences 
(Introns/intervening sequences). These Introns may contain regulatory 
elements such as enhancers. 

10 

The circular donor DMA may further comprises a promoter 
sequence that controls expression of the donor DMA sequence. The 
promoter may be any aray of DNA sequences that Interact specifically 
with cellular transcription Actors to regulate transcription of the 

16 downstream gene. The promoter may be derived ft-om any organism, 
such as bacteria, yeast. Insect and mammalian cells and viruses. The 
selection of a particular promoter depends on what cell type Is to be 
used to express the protein of interest Examples of the promoter 
Include, but are not limited to, £ co// lac and bp operons, the tac 

20 promoter, the bacteriophage X pf- promoter, bacteriophage 17 and SP6 
promoters, p-actin promoter, insulin promoter, human cytomegalovirus 
(CMV) promoter, HIV-LTR (HIV-long temninal repeat), Rous sarcoma 
virus RSV-LTR, simian virus SV40 promoter, baculovlral poiyhedrin and 
plO promoter. The promoter may also be an inducible promoter that 

25 regulates the expression of downstream gene in a controlbd manner, 
such as under a specific condition of the cell culture. Examples of 
Inducible promoters Include, but are not limited to, the bacterial dual 
promoter (actlvator/repressor expression system) which regulates gene 
expression in mammalian cells under the control of tetracyclines 

30 (Gossen, M. and Bujard. H. 1992, Proc.Natl. Acad. Sol. USA. 89. 5547- 
5551) and promoters that regulate gene expression under the control of 
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factors such as heat shocks, steroid hormones, heavy metals, phorbol 
ester, the adenovirus E1A element, interferon, or serum. 



The recombination site may be any segment or anays of DMA 
5 sequence recognized by site-specific recombinase which catalyzes site- 
specific fusion between the circular donor DMA and the acceptor vector. 
The site-specific recombinase may be any enzymes that recognize 
short DNA sequences that become the crossover regions during the 
recombination event, Including but not limited to recombinases, 
10 transposases and Integrases. 

Site-specific recombinases may derived from prokaryotic and 
eukaryotic sources. Examples of site-specific recombination include 1) 
chromosomal rean^ngements which occur in Salmonella typhimurium 

15 during phase variation, inversion of the FLP sequence during the 
replication of the yeast 2 \x circle and In the rearrangement of 
immunoglobulin and T cell receptor genes in vertebrates, 2) Integration 
of bacteriophages into the chromosome of prokaryotic host cells to form 
a lysogen and 3) transposition of mobile genetic elements (e.g., 

20 transposons) in both prokaryotes and eukaryotes. 

In one embodiment, the recombination site is a loxP site that is 
recognized by the Ore recombinase of bacteriophage PI. The Cre 
recombinase catalyzes recombination of DNA between two loxP sites. 
25 The loxP site consists of a double-stranded 34 bp sequence: 



5'-ATAACTTCGTATAATGTATQCTATACGAAGTTAT-3' 
3'-TATTGAAGCATATTACATACGATATGCTTCAATA-5' 



30 (SEQ ID N0:1} 
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The loxP site comprises two 13 bp inverted repeat sequences separated 
by an 8 bp spacer region, l-ioess et ai. (1982) Proc. Natl. Acad. Sd. 
USA 79:3398. Tlie internal spacer sequence of the loxP site is 
asynnmetrical and thus, two loxP sites can exhibit directionality relative 
5 to one another. Hoess et al. (1984) Proc. Natl. Acad. Sci. USA 81 : 1026. 
When two loxP sites on the same DNA molecule are in a directly 
repeated orientation, Cre excises the DNA between these two sites 
leaving a single ioxP site on the DNA molecule. Abremski et ai. (1983) 
Cell 32:1301. If two loxP sites are in opposite orientation on a single 
10 DNA molecule, Cre inverts the DNA sequence between these two sites 
rather than removing the sequence. 

The Cre recombinase also recognizes a number of variant or 
mutant lox sites relative to the loxP sequence. Examples of these Cre 
15 recombination sites include, but are not limited to, the loxB, loxL and 
loxR sites which are found in the E. coli chromosome. Hoess et al. 
(1982), supra. Other variant lox sites Include: 

loxPSli site: 5*-ATAACTTCGTATAGTATACATTATACGAAGTTAT-3' 
20 (SEQ ID N0:2); 

Hoess et al. (1986) Nucleic Acid Res. 14:2287-2300, 

loxC2 site: 5'-ACAAC TTCGTATAATGTATGCTATACGAAGTTAT-3' 
(SEQ ID N0:3) 
25 U.S. Pat. No. 4,959,317. 

Cre catalyzes the cleavage of the lox site within the spacer region 
and creates a six base-pair staggered cut. Hoess and Abremsid (1985) 
J. Mol. Biol. 181:351. The two 13 bp inverted repeat domains of the lox 
30 site represent binding sites for the Cre protein. If two lox sites differ in 
their spacer regions in such a manner that the overhanging ends of the 
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cleaved DMA cannot reanneal with one another. Cib may not efficiently 
catalyze a recombination event using the two different lox sites. For 
example, it has been reported that Cre could not efficiently recombine a 
loxP site and a loxPS1 1 site; these two lox sites differ in the spacer 
region. Two lox sites which differ due to variations in the binding sites 
(i.e., the 13 bp inverted repeats) may be recombined by Cre provided 
that Cre can bind to each of the variant binding sites; the efficiency of 
the reaction between two different lox sites (varying in the binding sites) 
may be less efficient that between two lox sites having the same 
sequence (the efficiency will depend on the degree and the location of 
the variations in the binding sites). For example, the loxC2 site can be 
efficiently recombined with the loxP site; these two lox sites differ by a 
single nucleotide in the left binding site. 

The Cre protein has been purified to homogeneity. Abremski et 
ai. (1984) J. Mol. Biol. 259:1509. And the cre gene has been cloned 
and expressed in a variety of host cells. Abremski et ai. (1983), supra. 
Purified Cre protein Is a\«liable from a number of suppliers (e.g., 
Novagen and New England Nuclear/Du Pont). 

The recombination site of the circular DMA may also be selected 
from a variety of other recombination sites recognized by recomblnases 
other than Cre. Examples of the non-Cre recomblnases include, but are 
not limited to, site-specific recomblnases include: the int recombinase 
of bacteriophage □, the FLP recombinase of the 2pi plasmid of 
Saocharanyces cere\nsiae, the resolvase family, transposase of 
Badllus thnifnglensis. 

The Int recombinase of bacteriophage X betongs to the Integrase 
family and mediates the Integration of the X genome into the £ coll 
chromosome. The Int recombinase of bacteriophage X promotes 
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irreversible recombination between its substrate att sites as part of the 
the fomiatlon or Induction of a lysogenic state. Landy, A., (1989) Ann. 
Rev. Blochem. 58:913. Reversibility of the recombination reactions 
results from two independent pathways for integrative and exclslve 
5 recombination. Each pathway uses a unique but overlapping set of the 
15 protein binding sites that comprise att site DMAs. Cooperative and 
competitive Interactions involving four proteins (Int. Xis, IHF and FIS) 
detemnlne the direction of recombination. Integrative recombination 
involves the Int and IHF proteins and sites attP (240 bp) and attB (25 

1 0 bp). Recombination results in the fbrnnation of two new sites: attL and 
attR. Excisive recombination requires Int, IHF, and Xis. and sites attL 
and attR to generate attP and aftB. Under certain conditions, FIS 
stimulates excisive recombination, in addition to these normal reactions, 
it should be appreciated that attP and attB, when placed on the same 

15 molecule, can promote excisive recombination to generate two excision 
products, one with attL and one with attR. Similarly, intennolecular 
recombination between molecules containing attL and attR, in the 
presence of Int, IHF and Xis, can result in integrative recombination and 
the generation attP and attB. Derivatives of the att site with changes 

20 within the 1 5 bp core may also be suitable for efficient recombination. 
By incorporating a native or modified att site in both the circular donor 
DNA and the acceptor vector, intermolecuiar recombination between the 
donor and acceptor DISIA molecules may be achieved by using the 
appropriate recombination protein such as Int, IHF and FIS, with or 

25 without Xis. Integrase can be obtained as described by Nash, H. A., 
(1983) Methods of Enzymology 100:210-216. IHF can be obtained as 
described by Fllutowicz, M., et al., (1994) Gene 147:149-150. 

. The other members of the integrase family of site-specific 
30 recombinases may also be used to provide alternative recombination 

proteins and recombination sites for the present invention. Examples of 
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such Int recombinases include, but not limited to, site-specific 
recomblnase encoded by bacteriophage X, phi 80, P22, P2, 186, P4. 
This group of recombinases exhibits a large diversity of sequences, but 
ail of the recombinases can be aligned in their C-tennlnal halves. A 40- 
5 residue region near the C tenninus Is particularly well conserved in ail 
the proteins and is homologous to a region near the C temiinus of the 
yeast 2^ plasmid Flp protein. Three positions are perfectly conserved 
within this family: histidine, arginine and tyrosine are fbund at respective 
alignment positions 396, 399 and 433 within the well-conserved C- 
10 terminal region. These residues contribute to the active site of this family 
of recombinases, and suggest that tyrosine-433 fonns a transient 
covalent linkage to DNA during strand cleavage and rejoining. Argos, P. 
et ai., (1986) EMBO J. 5:433-40. 

1 5 The FLP recomblnase of the 2pi plasmid of SaGoharomyces 

caraviaiae recognizes the frt site which, like the loxP site, comprises two 
13 bp inverted repeats separated by an 8 bp spacer 

5'-GAAGTTCCTATTCTCTAGAAAGT ATAGGAACTTC-3' 
20 (SEQ ID N0:4) 

Cox (1983) Proc. Natl. Acad. Scl. USA 80:4223. 

The FLP gene has been cloned and expressed in E. coil and In 
mammalian cells and has been purified. Meyer-Lean etal. (1987) 
25 Nucleic Acids Res. 15:6469; Babineau et al (1985) J. Biol. Chem. 
260:12313; Qronostajski and Sadowski (1985) J. Biol. Chem. 
260:12328. 

The resolvase family members, such as the Tn3 resolvase, the 
30 Hin recomblnase, and the Cin recomblnase, may also be used for 

recombination between the circular donor DNA and the circular acceptor 
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DNA. Although members of this highly related family of recomblnases 
are typically constrained to Intramolecular reactions (e.g., Inversions and 
excisions) and can require host-encoded factore. Mutants have been 
Isolated that relieve some of the requirements for host factors as well as 
6 some of the constraints of intramolecular recombination. Maeser and 
Kahnmann (1 991 ) Mol. Gen. Genet 230: 1 70-1 76. 



Transposase of Badllus thuringlensis may also be used as 
recombination proteins and recombination sites. Bacillus thuringlensis is 

10 an entomopathogenic bacterium whose toxicity is due to the presence in 
the sporangia of A-endotoxin crystals active against agricultural pests 
and vectors of human and animal diseases. Most of the genes coding 
for these toxin proteins are plasmid-borne and are generally structurally 
associated with insertion sequences (IS231, IS232, IS240, ISBT1 and 

15 ISBT2) and transposons (Tn4430 and Tn5401). Several of these mobile 
elements have been shown to be active and participate in the crysfal 
gene mobility, thereby contributing to the variation of bacterial toxicity. 
Structural analysis of the lso-IS231 elements indicates that they are 
related to IS1 1 51 from Clostridium perMngens and distantly related to 

20 184 and IS1 86 from E Goli. Like the other IS4 family ntembers, they 
contein a conserved transposase-integrase motif found in other IS 
families and retroviruses. Functional data gathered from 1S231A in E. 
{70// indicate a non-replicative mode of transposition, with a preference 
for specific targets. Similar results were also obtained in Bacillus subWs 

25 and B. thuringlensis. Mahlllon, J. et al., (1994) Genetica 93:13-26; 
(1992) Campbell, J. Bacterioi. 7495-7499. 

Other recombination systems may also be used as recombination 
proteins and recombination sites, Including the xerC and xerD 
30 recomblnases of £. coll which together fbnn a recombinase that 
recognizes the 28 bp dif site (Leslie and Shenratt (1995) EMBO J. 
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14:1561); the Int protein from the conjugatlve transposon Tn916 (Lu and 
Churchward (1994) EMBO J. 13:1541); Tpnl and the p-lactamase 
transposons (Levesque (1990) J. Bacteriol. 172:3745); and the 
immunoglobulin recombinases (i\/lalynn et al. (1988) Cell 54:453). 

5 

Other than the wild-type recombination sites, modified 
recombination sites may also be used in the present invention. Wild-type 
recombination sites may contain sequences that reduce the efficiency or 
specificity of recombination reactions. For example, multiple stop 

10 codons In attS, attR, attP, attL and loxP recombination sites occur in 
multiple reading frames on both strands, thereby reducing 
recombination efficiendes. For example att sites, such as att1, att2, and 
attS sites, may be modified to have one or multiple mutations to 
enhance specificity or efficiency of the recombination reaction and to 

15 decrease reverse reaction by removing Pland HI from attB. 

The circular donor DNA also contains one or more selectable 
markers to facilitate subsequent identification and selection of clones of 
the recombination product under suitable conditions. The selectable 

20 marker may encode any functional element, such as protein, peptide, 
RNA, binding site for RNA and proteins, or products that provide 
resistance to organic or inorganic agents. Examples of selectable 
markers include, but are not limited to, reporter genes such as p- 
galactosidase (GAL), fluorescent proteins (e.g., GFP, GFP-UV, EFFP, 

25 BFP, EBFP, ECFP, EYFP), secreted fonn of human placental alkaline 
phosphatase (SEAP), p-glucuronldase (GUS)); resistance genes that 
encodes products which provide resistance against other wise toxic 
agents such as antibiotics (e.g. neomycin (G418) or hygromycin 
resistant gene, puromycin resistant gene), yeast seletable markers Ieu2- 

30 d and URA3, apoptosis resistant genes (e.g. the baculovlral p35 gene) 
that encode proteins that binds to products which are detrimental to cell 
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survival and promote apoptosis; antlsenoligonucleoticles, and any other 
DNA that encodes product that directly or directly confer sensitivity of 
cells to particular agents. 

6 The circular donor DNA may optionally include an affinity tag for 

selection and isolation of protein product encoded by the donor DNA 
segment. Examples of such an affinity tag include, but are not limited 
to, a polyhistidlne tract, polyarginlne, giutathione-S-transfbrase (GST), 
maltose binding protein (MBP), a portion of staphylococcal protein A 

1 0 (SPA), and various ImmunoafRntty tags (e.g. protein A) and epitope tags 
such as those recognized by the EE (Glu-Glu) antipeptide antibodies. 
Th affinity tag may also be a signal peptide either native or heterologous 
to baculovims, such as honey bee mellitin signal peptide. The affinity 
tag may be positioned at either the amino- or carboxy-temninus of the 

15 donor DNA. 

2. Circular Acceptor Vector 

The present invention also provides a circular acceptor vector for 
20 generating recombinant expression vector. The vector comprises an 
origin of replication; and an acceptor recombination site capable of 
recombining vi^ith a donor DNA. Optionally, the acceptor vector may not 
include a promoter for regulating expression of the donor DNA. 

25 The circular acceptor vector may be any vector that can 

transfonn, transfect or transduce a host cell. The acceptor vector 
comprises a recombination site which is recognized by a site-specific 
recombinase and recombined with a donor DNA carrying another 
recombination site. The acceptor vector may be plasmids, phages or 

30 viral vectors as long as it is able to replicate in vitro, or in a host cell, or 
to convey the donor DNA to a desired location within a host cell. 
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Examples of host cells Include, but are not limited to, bacterial (e.g. E. 
coll, Bacillus subtllls, etc.). yeast, animal, plant, and insect cells. 

For plasmld-based expression vector, tfie recombination site may 
5 be Introduced into the vector by a double-stranded oligonucleotide 
containing the desired site-specific recombination site (e.g., a lox site). 
The double-stranded oligonucleotide may be formed by annealing two 
synthetic single-stranded ollgonuceotldes to form two ends which are 
compatible with ends of a linearized plasmid vector. The matching ends 
10 may be generated by restriction enzyme digestion or by using cloning 
kits such as the TA cloning kits available from Invitrogen, Inc. (San 
Diego, CA). 

The circular acceptor vector may be any prokaryotic plasmid that 
15 contains a recombination site (e.g. loxP site), a basic backbone of 

plasmid cloning vector such as pBR322, including one or more antibiotic 
resistant genes (e.g. Amp\ Tef) and an origin of replication that fijnction 
In specific host ceils. After recombination between the circular donor 
DNA and the acceptor vector to fbnn a fused plasmid, this plasmki 
20 vector can be used to transfbnn bacterial ceils. In the transfomied cell, 
a prokaryotic promoter, either carried by the donor DNA or the acceptor 
vector, causes expression of the donor DNA under suitable conditions. 

Optionally, the acceptor vector may comprise a prokaryotic 
25 tennination sequence. Examplesof the prokaryotic temnination 

sequence include, but are not limited to, the T7 termination sequence. A 
variety of temnlnation sequences are known to the art and may be 
employed in the nucleic add constructs of the present invention 
including, the Tjnt, Ty. T^, T^s, TRi, TRj, Tes temilnation signals derived 
30 from the bacteriophage □. Hendrix et al. Eds., Cold Spring Haridor 

Press, Cold Spring Harbor, N.Y. (1983) and temilnation signals derived 
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from bacterial genes such as the tip gene of E. coli. 

The circular acceptor vector may also be a mammalian 
expression vector that contains a recombination site (e.g. ioxP site), one 
6 or more eul<aryotic marker gene, the appropriate eukaryotic 

transcriptional and translational termination signals and a sequence that 
signals polyadenylatlon of the transcript messenger RNA (mRNA), and 
an orgin of replication that functions In a mammalian host cell. 
Examples of the eukaryotic polyadenylatlon sequence include, but are 
1 0 not limited to, the Herpes simplex virus thymkline kinase 

polyadenylatlon sequence, the bovine growth honnone polyadenylatlon 
sequence, and the simian virus 40 polyadenylatlon sequence. 

If the circular acceptor canies a promoter for regulating 
1 5 expression of the donor DNA sequence, the recombination site may be 
placed downstream the promoter and transcription initiation site in 
acceptor vector. This modification of the vector may be easily 
accomplished using synthetic ollgonucteotides comprising the desired 
recombination site (loxP site), in designing the oligonucleotide 
20 comprising the recombination site, it may be desirable to avoM 
introducing an ATG or start codon that might initiate translation 
Inappropriately, or in-frame stop codons. 

For expression vectors intended to generate a fusion protein 
25 between a protein domain located at the amino-tennlnus of the fusion 
protein and the protein encoded by the donor DNA, care may be taken 
to place the recombination site in the conect reading frame such that 1) 
an open reading frame is maintained through the recombination site on 
pHOST and 2) the reading frame in the recombination site on the 
30 acceptor vector is in frame with the reading frame found on the 
recombination site contained within the circular donor DNA. 
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Optionally, the eukaryotic expression vector may also canv an 
origin of replication and selectable marker genes that function in 
bacterial cells, forming a shuttle vector. After recombination between 
5 the circular donor DNA and the acceptor vector to form a fused 

expression vector, this vector can be used to transfect animal cells. In 
the transfected cell, a eukaryotic promoter, either canled by the donor 
DNA or the acceptor vector, causes expression of the donor DNA under 
suitable conditions. 

10 

The circular acceptor may also be a yeast expression vector 
such as a S. cerevisiae expression vector that includes a recombination 
site. Various types of S. cerevisiae expression vector include episomal 
or plasmid vector, integrating vectors, and yeast chromosomes (YACs). 
15 A YAC-based expression vector may be used to cany large segment of 
donor DNA, which is then maintained as a separate chromosome In the 
host yeast cell. 

The circular acceptor vector may also be a bacuiovirus DNA 
20 (genome) that is modified to contain a recombination site (e.g. a loxP 
site). For example, the baculovlral genome of Autographa califbmica 
multiple nuclear polyhedrosis virus (AcMNPV) may be modified to 
include a site-specific recombination site, such as a loxP site. A. 
califomica (the alfalfa loop) and over 30 other insect species can be 
25 infected by AcMNPV. This virus also grows well on many insect cell 
lines, such as Sf cell lines derived from the fall anrtywonn, Spodoptera 
frugiperda. In these cells, the the promoter of the viral protein, 
polyhedrin, is exceptionally strong when the virus infects the cell Such 
polyhedrin promoter can promote high level expression of downstream 
30 foreign gene that replaces the polyhedrin gene. 
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The recombination site (e.g. a loxP site) may be introduced Into a 
baculqviral vector through homologous recombination in vivo. For 
example, a baculoviral transfer vector may be used to shuttle the loxP 
site Into the wild type baculoviral DNA (e.g. AcMNPV) to fomi the final 
5 circular baculoviral vector. An example of such transfer vector is the 
p36C that contains baculoviral polyhedrin flanking sequences, up- and 
down-stream AcMNPV DNAs for homologous recombination. Page 
(1989) Nucleic Acids Res. 17: 454. The lopP site may be Introduced 
Into the transfer vector by regular oligonucleotide-medlated 

10 mutagenesis. A portion of the polyhedrin flanking sequence Is replaced 
with the loxP site using an oligonucleotide containing the loxP site. The 
the resulting loxP-contalning transfer vector and the wiM type baculoviral 
DNA are co-transfected into insect cells such as Sf9 ceils. Because the 
transfer vector contains the polyhedrin flanking sequences, a double 

15 crossover homologous recombination occurs in the cells, causing the 
replacement of the polyhedrin gene in AcMNPV DNA with the loxP site 
and therefore resulting in the integration of the loxP site Into the 
AcMNPV genome. After a desired period of time (e.g. 72 hr) the 
supernatant of the insect cell culture Is harvested and the progeny virus 

20 Is screened In a standard agarose overiay assay. Brown and Faulkner 
(1977) J. Gen. Virol. 36: 361-364. Polyhedrin-negative plaques are 
purifled to homogenity by successive rounds of agarose overlay assay. 
The presence of the ioxP site may be confimied by Southern analysis of 
DNA from the infected cells. 

25 

Optionally, the baculoviral acceptor vector according to the 
present Invention may not contain the polyhedrin promoter. Instead, the 
polyhedrin promote or a baculoviral plO promoter is positioned 
upstream of the donor DNA sequence of the circular donor DNA. After 
30 site-specific recombination between the circular donor DNA and a 

t>aculovlral acceptor vector, the resulting recombinant baculoviral vector 
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can be used to infect insect ceiis, such as Sf9 celis. 

Also optionally, the circular acceptor vector according to the 
present invention may include a negatively selectable marker gene, 
6 such as the selectable marker based on heipes simplex virus tk gene or 
any kind of gene encoding a protein that signals apoptosis and causes 
programmed cell death, such as the CAR1 gene. Expression of these 
negatively selectable marker genes resuKs In cell death, thereby 
eliminating those cells containing the circular acceptor vector that does 
1 0 not recombine vi/ith the circular donor DNA. 

The present invention also provides kits for generating 
recombinant vectors. In one embodiment, the kit comprises: a double- 
stranded circular donor DNA comprising a donor DNA sequence, a 
15 donor recombination site, and at least one selectable marker, the 

circular donor DNA not including an origin of replication; and a circular 
acceptor vector comprising an origin of replication and an acceptor 
recombination site capable of recombining with the circular donor DNA. 

20 In another embodiment the kit comprises: a library of double- 

stranded circular donor DNA comprising a donor DNA sequence virtiich 
varies within a library of donor DNA sequences, a donor recombination 
site, and at least one selectable marker, the circular donor DNA not 
including an origin of replication; and a circular acceptor vector 

25 comprising an origin of replication and an acceptor recombination site 
capable of recombining with the circular donor DNA. 

In yet another embodiment, the kit comprises: one or more linear 
donor DNA comprising a donor DNA sequence; a linear driver DNA 
30 comprising a promoter sequence, a recombination site, and at least one 
selectable marker, ligation of the linear donor DNA and the linear driver 
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DNA resulting in a circuiar donor DMA; and a circular acceptor vector 
comprising an origin of replication and an acceptor recombination site 
capable of recomblning with the circular donor DNA. 

5 

3. Method of Generating Recombinant Expression Vectors 
through Sits-Specific Recombination 

The present invention provides a method for generating 
1 0 recombinant expression vector in vitm through site-specific 

recombination between a circular donor DNA and drcular acceptor 
DNA, each containing recombination site recognized by the 
recombinase. The method comprises: contacting a circular double- 
stranded donor DNA and a circular acceptor vector in the presence of a 
IS recombinase under conditions suitable for the circular double-stranded 
donor DNA and circular acceptor vector to recombine to fbmi a single 
fused circular vector. 

In this method, the circular double-stranded donor DNA 
20 comprises a donor DNA sequence, a donor recombination site, and at 
least one selectable marker, but not including an origin of replication. 
The circular acceptor vector comprises an origin of replication and an 
acceptor recombination site capable of recombining with the circular 
donor DNA. The promoter for regulating expression of the donor DNA 
25 may be contained In either the donor DNA or acceptor vector. 

According to this method, the circuiar donor DNA containing a 
site-specific recombination site can be recombined with a circular 
acceptor vector in the presence of a site-specific recombinase, such as 
30 Cre recombinase. In vitro Cre recombinase can catalyze fusion of the 
donor DNA into the acceptor vector which contains another copy of the 
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recombination site (e.g. loxP site) found on the circular donor DMA. 
Optionally, the circular donor DNA may contain a promoter positioned 
upstream of the donor DNA sequence to regulate expression of the 
donor DNA once the recombinant vector is introduced into a host cell. 

5 

Alternatively, the promoter for regulating expression of the donor 
DNA sequence may be positioned upstream of the recombination site of 
the acceptor vector. Following the site-specific recombination between 
the recombination sites located on the donor DNA and acraptor vector, 
1 0 the two circular DNAs are stably fUsed in a manner that places the 
expression of the donor DNA sequence under the control of the 
promoter contained within the acceptor vector. The recombination 
occurs in a manner that retains the proper translational reading frame of 
the donor DNA. 

15 

Following the in vitro recombination, a portion of the reaction 
mixture may be used to trensfomn, transfect or transduce a suitable host . 
cell to pennit the recovery and propagation of the recombinant 
expression vectors. The correctly fused recombinant vector may be 

20 selected for its ability to trensfomn, trensfect or transduce a host cell and 
express the selectable marl<er that is contained in the circular donor 
DNA and recombined into the acceptor vector. The selectable 
phenotype conferred by the selectable marlcer gene on the recombinant 
vector may be change of color of the host cells upon proper chemical 

25 treatment, secretion of protein in the culture that is detectable by 

coiorometers or fluoremetere, survival of cells under selection pressure 
or enhanced propagation of a virus such as bacuiovims. The circular 
donor DNA cannot replicate in cells because it does not contain an 
origin of replication, and therefore, unless the circular donor DNA has 

30 integrated into the acceptor vector that contains an origin of expression, 
the circular donor DNA should not replicate in the host cell. The 
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recombinant expression vector may be isolated from host cells which 
display the desired phenotype and verified by using various methods 
such as restriction enzyme digestion, sequencing and Southem blotting. 

5 The present invention also provides a method for generating 

recombinant expression veclore from linear DMA segments In vitro. The 
method comprises: ligating one or more double-strended linear donor 
DNA which Includes a donor DMA sequence with a double-stranded 
linear driver DNA which includes a promoter sequence and a donor 

1 0 recombination site to fbrni a single circular donor DNA, the singular 
circular donor DNA not including an origin of replication, where the 
donor DNA sequence Is under the transcriptional control of the 
promoter; and contacting the circular donor DNA and a circular acceptor 
acceptor vector in the presence of a recombinase to fomi a single fused 

1 5 circular vector. In this method, the circular acceptor vector comprises 
an origin of replication and an acceptor recombination site capable of 
recomblning wKh the circular donor DNA. 

According to this method, the linear donor DNA and linear driver 
20 DNA may contain matching restriction sites or other type of annealing 

sites so as to be llgated to form a circulalzed DNA. After the ligation, the 
circularized donor DNA contains a site-specific recombination site 
carried by the linear driver DNA, Such a circular donor DNA is 
recombined with a circular acceptor vector in the presence of a site- 
25 specific recombinase, such as Cre recombinase. The Ore recombinase 
can catalyze fusion of the donor DNA into the acceptor vector which 
contains another copy of the recombination site (e.g. loxP site) found on 
the circular donor DNA. 

30 Figure 1 1llustrates a general scheme of this method according to 

the present Invention. As Illustrated In Figure 1, a library of linear donor 
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DMA (underlined) is directly ligated to a linear driver DNA (underiined) to 
generate a circular donor DNA. This allows for efficient cloning of donor 
Dl^ (e.g. cDNA library) into an expression vector in an automated and 
iiigh throughput manner. 

6 

The linear donor and driver DIMA can be generated by PGR 
amplification of a template Di^, either linear or contained in piasmids of 
cDNA clones. As illustrated in Figure 1, a library of cDNA clones from a 
desired source may be rean^yed into 86-well plates by using a re- 
10 an^ying robot. The library of cDNA clones may be cosmid and phage 
libraries that contain inserts primarily from single human chromosomes 
isolated by flow-sorting. These cDNA clones may be identified by their 
short sequence tags or expressed sequence tags (ESTs). These cDNA 
clones may be an^yed in multi-well mlcrotlter plates such as 96- or 384- 
15 well plates and handled by robots. For each re-anayed plate, data 
tracking system may be used to identify each clone, sequence 
accession number, passage number, eta 

The library of linear donor DNA may also be generated and 
amplified from total RNA or mRNA samples by using an RT-PCR 
method. The library of linear donor DNA can then be ligated with the 
linear driver DNA to form the circular donor DNA which is recombined 
with the circular acceptor. This allows direct transfemng of the cDf^ 
library (donor DNA) to the expression vector (acceptor vector) without 
going through a cloning step. 

Aitemativeiy, the library of linear donor DNA may be generated 
by random or site-directed mutagenesis of one or more target gene 
sequence. For example, "poisoned" PGR or Dl^ shuffling techniques 
may be used to generate a diverse library of donor DNA which can be 
incorporated into the circular donor DNA by using direct ligation 
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according to the present Invention. Clones of these mutagenized library 
can be screened for improved or unique blologlcai functions in an 
appropriate expression system. 

5 Bacterial cultures of the cDNA clones may be grown in the multi- 

well microtiter plates and directly used for PGR. Suitable PGR primers 
containing an annealing site may be used to amplify each cDNA done in 
the plate. For example, PGR primers may be designed to hybridize to 
the sequence of the cDNA clones and contain sequences for Ligation 
10 independent Cloning (LiC). High-fidelity thennostabie polymerases are 
preferably used to reduce copying errors and numbers of amplification 
cycles. The PGR product, the linear cDI^ library, may optionally be 
purified by passing through mini columns. 

15 Simllariy, the linear driver DNA may also be amplified by using 

prirners containing a matching annealing site as the primers for 
amplifying donor DNA. The template of the driver DISIA may also be a 
plasmid which contains ftinctionai elements, such as a site-specific 
recombination site, a promoter, a selectable maricer gene, a tag, and a 

20 transcription temnination signal. 

Still refenring to Figure 1, the linear driver DNA is annealed and 
ligated with the linear donor DNA under suitable conditions in multiple- 
well plates (e.g. in the presence of T4 ligase). For ligation independent 
25 cloning (LiC), the linear driver Dl^ and the linear donor DNA generated 
by LIC PGR amplification are digested with T4 polymerase, annealed, 
and ligated together to forni a library of circular donor DNAs 
(underiined). 

30 The circular donor DNA may be mixed with a circular acceptor 

vector (underlined) containing a suitable recombination site (e.g. loxP) in 
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the presence of a site-specific recombinase (e.g. Cre recombinase). For 
example, a circular baculoviral vector containing a ioxP site may be 
mixed with an an^y of circular cDNA (circular donor DNA) in the 
presence of purified Cre recombinase or cell extract containing Cre. 
5 The amount of recombinase which is added to drive the recombination 
reaction between the donor and vector DNAs can be detemiined by 
using a titration assay is used to detennine the appropriate amount of 
the purified recombinase enzyme or the appropriate amount of a Cre 
extract. The recombination reaction buffer compatible with purified Cre 

10 recombinase may contain 50 mM Tris-HCi (pH 7.5), 1 0 mM MgClg, 30 
mM NaCI and 1 mg/ml BSA. The concentration of the drcuiar donor 
DNA and the acceptor vector may vary between 10 ng to 10 ^g of each 
vector per 20 fil reaction volume. The recombination reaction may be 
incubated at 37°C for a necessary period of time and temiinated by 

1 5 heating at TO'C for 1 5 min. 

Site-specific recombination between the donor cDNA anay and 
the acceptor baculoviral vector in the presence of Cre recombinase 
results In a recombinant baculoviral vector (tiie vector DNA in Figure 1). 

20 The recombinant baculoviral vector may be used to transfect Sf9 cells 
(the host cell in Figure 1) an^yed in multi-well plates. The plates may 
be incubated at 27^ for several days. The progeny baculovirus may be 
screened in an overlay assay of the supernatant. For example, the 
presence of the integrated donor DNA in ttie viral progeny may be 

25 detemiined by detecting level of expression of the selectable mariner 

(e.g. p-GAL, GUS). Alternatively, a baculovirus apoptosis resistant gene 
(baculovims p35 gene) may be used as a selectable marker to confer 
increased viral yield in certain incest cell lines. Clem et al. (1991) 
Science 254: 1388-1390. Recombinant baculovirus bearing this gene 

30 has been shown to be amplified up to a million folds in appropriate host 
cells. Lerch et al (1993) Nucleic Acid Res. 21: 1753-1760. 
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The protein product expressed by the recombinant expression 
vector may be seletively identified and purified by using affinity tagging. 
The tag element may be a short DNA sequence inserted between the 
5 promoter and the transcription termination sequence in a proper reading 
frame on the recombinant expression vector. Such a tag element may 
encodes a short amino acid sequence that specifically binds to a 
compound or a macromoiecule. After transcription and translation of the 
recombinant expression vector in the host ceils the short amino acid 

1 0 sequence which is part of the donor DNA construct, acts as an 

identification tag (affinity tag), in another word, the tag canied by the 
circular donor DNA that Is recombined with the acceptor DNA Is 
expressed as a fusion protein encoded by the donor DNA. The 
presence of the fusion protein can be identified through the affinity 

1 5 binding of the tag to ite binding partner. 

For example, a polyhistidine tag (e.g. His^ may be used for 
isolating fusion proteins. The fusion protein carrying a His tag can be 
isolated by passing all proteins of a cell extract (e.g. Sf9 cells) through a 

20 column of Ni-triacetic add ararose beads. The unbound proteins are 
eluted from the column, and the bound fusion protein selectively 
removed by either adding a competitor compound (e.g. imidazole) that 
dislodges the His tag of the fusion protein fi-om the Ni ions or by lowing 
the pH of the elution buffer, if necessary, the tag may be removed form 

25 the fusion protein with a protease that cleaves only at the engineered 
site. • 

Altematively, the affinity tag may be a protein tag (e.g.. GST, 
maltose binding protein) or an epitope tag that are short amino acid 
30 sequences binding to glutathione, maltose and specific antibodies. For 
example, an antibody against an EE epitope tag may be used for 
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identHying and purifying fusion protein encoded by ttie donor DNA. Tlie 
EE (Glu-Qlu) antipeptlde antibody was raised against a peptide ttiat 
contains tlie major tyrosine pliosplioryiation site of polyoma middle T 
antigen. Talmage et al. (1989) Cell 59: 55-56. The antibody has 
5 moderate affinity for the EE tag (K^ ~ 2x1 0*^) which allows rapid elution 
of tagged fusion proteins by free peptide under non-denaturing 
conditions while retaining efficient binding of the fusion protein in caide 
lysates. Further, since the EE tag Is a strong tyrosine Idnase 
phosphorylation substrate f6r protein kinases (e.g. Src) it may serve as 
10 a detectation label fbr high throughput assays for protein interactions. 

The methods of the present invention allow rapid and efficient 
generation of expression vectors containing the gene of Interest without 
bacterial cloning. Various libaries of cDNA or genomic DNA that are 

15 difficult to be cloned in bacteria can be directly amplified and introduced 
Into any expression vector. The methods can be used In a wide variety 
of high through anrays for functional genomics, protein genomics and 
other human genome projects. For example, the methods may be used 
for systematic functional analysis gene profiling, gene tagging, gene 

20 overexpresslon, or systematic transcript analysis. The Infonnation 

generated can shed fight on functionally Important pathways In diseased 
cells In many important areas such as oncology and inflammation. 
Further, the methods may be used for high-throughput genetic screens 
for target discovery and validation, as well as drug discovery based on 

25 the targete discovered In the screens. 



Figure 2 Illustrates an example of how to constmct a baculoviral 
30 expression vector for the GUS gene according to the present Invention. 
As illustrated in Figure 1, a recombinant baculoviral expression vector Is 
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generated in vitro without a cloning step. The following is a detailed 
elucidation of the steps of using the method. 

1. Construction of the driver olasmld 

6 

A linear double-stranded driver DNA (underlined) is generated 
from a driver plasmid containing a bacuiovlral p10 promoter, a Neo 
marker gene, a loxP site and appropriate restrictions sites such as Sfi I 
and Not I sites. The Driver plasmid Is constmcted by using standard 

1 0 plasmid construction techniques as taught in Sambrook, et al. 

Moloojlar Cloning: A laboratory Manual; DNA Cloning: A Practical 
Approacf), Vol I & II (D. Glover ed.); Oligonucleotide Synthesis (N. Glat, 
ed.). The driver plasmid is based on pBR322, a standard bacterial 
cloning vector plasmid, and the following elements are cloned into 

15 pBR322 contiguously, in the order shown below: 

a) an Sfil restriction site (GGCCNNNNNGGCC) 

b) aloxPsite 
(ATAACTTCGTATAATGTATGCTATACGAAGTTAT) 
[SEQ ID NO: 1] 

20 c) a Neo gene with promoter and SV40 poly A addition 

site 

d) a baculovims plO promoter 

e) an EE-tag sequence (MEEEEYMPME) [SEQ ID NO: 5] 

f) a NotI restriction site (GCGGCCGC) 

25 

Sequence of the bacuiovlral plO promoter is described in Weyer 
U, Posses RD. (1989) J. Gen. Virol. 70:203-8 and is supplied by 
pAcAB4 from BD Phamningen, San Diego, CA. The sequence for ttie 
Neo gene vy^th a promoter and SV40 poly A addition site is available 
30 from the plasmid plE-neo supplied by Novagen Ina 
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2. Ampllflcatten and processing of tha donor oene DMA 

A linear double-stranded donor DMA (underlined) is amplified by 
PGR from a donor plasmid or a cDNA clone carrying the GUS gene. 
5 Oligonucleotide PGR primers that are homologous to the 5' and 3' ends 
of the GUS gene (source: pBacPAK8-GUS available from Clontech, 
Palo AKo, OA) are designed and synthesized. These primers also 
contain restriction sites (NotI for the 5' primer and Sfil for the 3' primer) 
in the 5' ends of the primers. The GUS gene is then amplified by PGR 
10 from the plasmid pBacPAK8-GUS containing the GUS gene using these 
primers. 

The PGR products fi'om GUS gene amplification are digested 
with NotI and Sfil restriction endonucleases to generate the appropriate 
15 sticky ends on the products, resulting in the linear double-stranded 
donor DNA. 

3. Construction of the circular donor DNA 

20 The driver plasmid described in Section 1 is digested with NotI 

and Sfil restriction endonucleases. The fragment which contains the 
elements listed above such as the loxP site and pi 0 promoter is isolated 
by gel electrophoresis purification, resulting in the linear double- 
stranded driver OUA. 

25 The linear driver DNA is annealed to an excess of the linear 

donor DNA and ligated by using T4 DNA iigase to produce the circular 
donor DNA (underiined). 

4. Constniction of the acceptor baculovirus Genomic DNA : 

30 A baculovirus transfer vector plasmid containing a loxP site is 

constructed according to the protocol described by Peakman et al. 



-39- 



wo 02/00875 



PCT/USOl/19770 



(1992) Nucleic Add Res. 20:495-500. Briefly, a double-stranded 
oligcnucieotlde consisting of a ioxP sequence [SEQ ID N0:1] with a 
blunt 5' end and a CTAG 5' overhang at the 3' end is synthesized. A 
baculovirus transfer vector pVL1392 (available from BD Phanningen, 
5 San Diego, CA ) is digested with EcoR V and BamH I restriction 

enzymes to remove the polyhedrin promoter in the vector. The double- 
stranded oligonucleotide with the loxP site is ligated with the EcoR 
V/BamH I digested piasmid pVL1392. The resulting ligated piasmid is 
transfonned into bacteria and screened for colonies with the loxP insert. 

10 This generates a baculoviral transfer vector containing a loxP site. 

The baculoviral transfer vector containing a loxP sKe is co- 
transfected with a linearized baculovirus DMA (BacPAK6 DMA from 
Clontech, Palo Alto, CA) by using standani baculovims constmction 
techniques (Methods in l\/lolecular Biology Vol. 39: Baculovims 

15 Expression Protocols, Christopher D. Ricardson ed., Humana Press, 
Totowa, NJ 1995). 

Recombinant baculovims that contains the loxP site is isolated by 
plaque purification and baculoviral DNA is prepared fiom the vims to 
produce the circular acceptor DMA. 

20 

5. Recombination of the circular donor DNA and the circular 
acceptor DNA and selection of the recombinant baculoviral DNA . 

The circular acceptor DNA and the circular donor DNA generated 
25 as described above is recombined in the presence of GST-Cre 

reoombinase (available from Invitrogen, San Diego. CA) by using the 
methods described by Liu et al. (1998) Cunent Biology 8:1300-1309. 
The mixture Is then heated at 65'C to Inactivate the Cre recomblnase. 

30 The circular acceptor DNA that has recombined with the circular 

donor DNA is selected and purified from the recombination reaction 
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mixture by using a method of "physical selection" described in Shejsard 
& Rae (1997) Nucleic Acid Research 25:3183. Briefly, a biotinylated 
synthetic oligonucleotide probe that is substantially homologous to any 
region present in the circular donor DNA is used to hybridize with the 
5 recombined baculoviral DNA. The hybridized complex is then separated 
by using avidin-coated magnetic beads via biotin-avidin high affinity 
binding. The circular donor DNA that is not recombined with the 
acceptor DNA may also be extracted fix>m the mixture. Since the 
circular donor DNA is incapable of r^ilcating in a host insect ceil and 
1 0 thus does not interfere with further functional analysis of the recombined 
baculoviral vector. The biotinylated oligonucleotide probe is preferably 
homologous to a GC-rich region of the circular donor DNA. 

6. Propagation and selection of the recombinant baculovirus 

15 

The recombinant bacuiovims DNA selected in the above- 
described process is transfected into Sf9 insect ceils using standanj 
baculovirus construction techniques (Methods in Molecular Biology Vol. 
39: Bacuiovims Expression Protocols, Christopher D. Ricanlson ed., 
20 Humana Press, Totowa, NJ 1995). 

The recombinant GUS baculovirus is passaged in the presence 
of G418 as described by Lerch and Friessen (1993) Nucleic Acid Res. 
21:1753-1760. This is a positive selection step for propagation of the 
25 recombinant GUS baculovirus containing the Neo gene as a selectable 
maricer. 

7. Assessment of recombinant prote in production 

30 The GUS activity is assayed by using the following protocol. The 

recombinant GUS baculovirus produced as described above is used to 
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infect Sf9 insect cells. After 2 days the Sf9 cells are lysed in a solution 
of: 20mM Tris pH8. 150 mM NaCI, 0.5% NP40 (10ml of solution per 
gram of cells). The resulting lysate Is serially diluted Into PBS plus 150 
ug/ml x-Gluc. Blue color in diluted samples indicates GUS activity. The 
5 tenninal dilution of the recombinant GUS baculovirus produced by using 
the method of the present invention is compared with that of a 
recombinant GUS baculovirus produced by using conventional 
recombination techniques. 

10 GUS protein expressed by the recombinant GUS baculovirus of 

the present invention is visualized by SDS gel electrophoresis and its 
levels of GUS expression are compared with those of GUS expression 
of a recombinant GUS baculovirus produced by using conventional 
recombination techniques. Briefly, Sf9 insect ceils are infected with the 

15 recombinant GUS baculovirus of the present invention. After 2 days, the 
infected ceils are lysed in a solution of: ZOrnM Tris pH8, 150 nM NaCi, 
0.5% NP40 (10ml of solution per gram of cells). The lysate Is loaded 
and run on an SDS-PAGE gel and stained with Coomassie blue. The 
intensity the 75Kd GUS band from ttie recombinant GUS baculovirus 

20 produced by using the method of the present invention is compared with 
that ftom a recombinant GUS baculovirus produced by using 
conventional recombination techniques. 

It will be apparent to tiiose skilled in the art that various 
25 modifications and variations can be made in the compounds, 

compositions, kits, and methods of the present invention wittiout 
departing from the spirit or scope of the invention. Thus, it is intended 
that Vne present invention covers the modifications and variations of tiiis 
invention provided may come wittiin the scope of the appended claims 
30 and tiieir equivalents. 
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CLAIMS 

What is claimed is: 

5 1 . A library of double-stranded circular donor DMA, comprising: 
a donor DIMA sequence which varies within a library of donor 
DMA sequences; 

a donor recombination site; and 
a selectable marker, 
1 0 wherein the circular donor ONA does not include an origin of 

replication. 

2. The library of circular donor DNA according to claim 1 , wherein 
the library of donor DNA is a library of cDNA or genomic DNA. 

15 

3. The library of circular donor DNA according to claim 2, wherein 
the cDNA library is derived from single human chromosomes. 

4. The iibraiy of circular donor DNA according to claim 1 , wherein 
20 the circular donor DNA further comprises a promoter sequence that 

controls expression of the donor DNA sequence. 

5. The library of circular donor DNA according to claim 4, wherein 
the promoter is derived from bacteria, yeast, insect, animal, plant or 

25 virus. 

6. The library of circular donor DNA according to claim 4, wherein 
the promoter is selected from the group consisting of £. coli lac and trp 
operons, the tec promoter, the bjacteriophage X promoter, 

30 bacteriophage T7 and SP6 promoters, p-actln promoter, insulin 

promoter, human cytomegalovirus (CMV) promoter, HIV-LTR, RSV-LTR, 
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SV40 promoter, baculovlral poiyhedrin and plO promoter. 

7. The library of circular donor DNA according to claim 4, wherein 
the promoter is an Inducible promoter. 

5 

8. The library of circular donor DNA according to claim 7, wherein 
the inducible promoter is selected from the group consisting of 
tetracycline, heat shock, steroid honmone, heavy metal, phorbol ester, 
adenovirus E1A element, Interferon, and serum inducible promoters. 

10 

9. The library of circular donor DNA according to dalm 1 , wherein 
the donor recombination site is a DNA sequence recognized by a site- 
specific recombinase to facilitate site-specific fusion between the circular 
donor DNA and an acceptor vector containing an acceptor 

15 recombination site. 

1 0. The library of circular donor DNA according to claim 1 , wherein 
the donor recombination site a recombination site recognized by a 
recombinase, a transposase or an integrase. 

20 

1 1 . The library of circular donor DNA according to dalm 1 , wherein 
the donor recombination site Is a lox site that is recognized by the Ore 
recombinase of bacteriophage PI. 

25 12. The library of circular donor DNA according to claim 1 1 , wherein 
the donor recombination site is selected from the group consisting of 
loxB, loxL, loxR, loxP,loxP3,loxP23,loxA86,loxAl17, loxPSII.and 
loxC2. 

30 1 3. The library of circular donor DNA according to dalm 1 1 , wherein 
the the donor recombination site is selected from the group consisting of 
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att Sites recognized by the Int recombinase of bacteriophage X, a FRT 
site recognized by FLP recombinase of the 2pi piasmid of 
Saccharomyoes ceravlslaa, a recombination site recognized by the 
resoivase family, and a recombination site recognized by transposase 
5 of Bacillus thruingiensis. 

14. The library of circular donor DNA according to claim 1 , wherein 
the selectable marker is selected fix>m the group consisting of 
galactosidase, fluorescent protein, secreted fomn of human placental 

10 alkaline phosphatase, p-giucuronidase, antibiotic resistance genes, 
yeast seletable markers Ieu2-d and URA3, apoptosis resistant genes, 
and antlsense oligonucleotides. 

1 5. The library of circular donor DNA according to claim 1 , wherein 
1 5 the circular donor DNA further includes an affinity tag. 

1 6. The library of circular donor DNA according to claim 1 5, wherein 
the affinity tag is selected from the group consisting of a poiyhistldine 
tract, polyarginine. glutathione-S- transferase, maltose binding protein, a 

20 portion of staphylococcal protein A, protein A, and epitope tag. 

1 7. The library of circular donor DNA according to claim 1 5, wherein 
the affinity tag is an EE tag. 

25 1 8. The library of circular donor DNA according to claim 1 , wherein 
the circular donor DNA further includes a polyadenylation signal. 

1 9. A double-stranded circular donor DNA, comprising: 
a donor DNA sequence; 
30 a donor recombination site; and 

at least one selectable marker, the circular donor DNA not 
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Induding an origin of replicatton. 

20. A circular acceptor vector, comprising: 
an origin of replication; and 

5 an acceptor recombination site capable of recombining with a 

donor DMA, the acceptor vector not including a promoter for regulating 
expression of the donor DMA. 

21 . The circular acceptor vector according to claim 20, wherein the 
1 0 circular acceptor vector is capable of transfonning, transfecting or 

transducing a host cell selected from the group consisting of a bacterial, 
yeast, animal, plant, and insect cell. 

22. The circular acceptor vector according to claim 20, wherein the 
1 5 circular acceptor vector is a prokaryotic plasmid. 

23. The circular acceptor vector according to daim 22 further 
comprising: a prokaryotic temfiinatlon sequence selected from the 
group consisting of the T7 tenfninatlon sequence, the bacteriophage k 

20 T,NT. Tu. Tia. Tia, TR^, TR2, and 7^ tenDlnatlon signals. 

24. The circular acceptor vector according to claim 19, wherein the 
circular acceptor vector is a mammalian expression vector. 

25 25. The circular acceptor vector according to claim 24 wherein the 
mammalian expression vector contains one or more eukaryotic maricer 
genes, a eukaryotic transcriptional and translational tennlnatlon signal 
and a polyadenylation signal. 

30 26. The circular acceptor vector according to dalm 24 wherein the 
mammalian expression vector includes an origin of replication and a 
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selectable marker gene that functions In bacterial cells. 

27. The circular acceptor vector according to dalm 1 9 wherein the 
circular acceptor vector is a yeast expression vector. 

6 

28. The circular acceptor vector according to claim 27 wherein the 
yeast expression vector is selected from the group consisting of 
episomal vector, plasmid vector, Integrating vector, and yeast 
chromosomes. 

10 

29. The circular acceptor vector according to claim 20 wherein the 
vector is a baculovlral vector. 

30. The circular acceptor vector according to claim 28 wherein the 

1 5 baculovlral vector js a modified Autogmpha califomica multiple nuclear 
polyhedrosis virus. 

3 1 . The circular acceptor vector according to claim 30 wherein the 
modified Autographa califomica multiple nuclear polyhedrosis virus does 

20 not include a polyhedrin promoter. 

32. A kit for generating a recombinant expression vector, comprising: 
one or more linear donor DNA comprising a donor DNA 

sequence; 

25 a linear driver DNA comprising a promoter sequence, a 

recombination site, and at least one selectable marker, iigatton of the 
linear donor DNA and the linear driver DNA resulting in a circular donor 
DNA; and 

a circular acceptor vector comprising an origin of replication and 
30 an acceptor recombination site capable of recomblning with the circular 
donor DNA. 
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33. A kit for generating a library of recombinant expression vectors, 
comprising: 

5 a library of double-stranded circular donor DNA comprising a 

donor Dl^ sequence whicli varies witiiin a library of donor DNA 
sequences, a donor recombination site, and at least one selectable 
marker, the circular donor DNA not including an origin of replication; and 
a circular acceptor vector comprising an origin of replication and 

1 0 an acceptor recombination site capable of recombining with the circular 
donor DNA. 

34. A method for generating recombinant expression vector, 
comprising: contacting a circular double-stranded donor DNA and a 

1 5 circular acceptor vector in the presence of a recombinase under 

conditions suitable for the circular double-stranded donor DNA and the 
circular acceptor vector to recombine to fomn a single fused circular 
vector, 

the circular double-stranded donor DNA comprising a 
20 donor DNA sequence, a donor recombination site, and at least one 
selectable marker, but not Including an origin of replication, and 

the circular acceptor vector comprising an origin of 
replication and an acceptor recombination site capable of recombining 
with the circular donor DNA. 

25 

35. The method according to dalm 34, wherein the circular donor 
DNA further Includes a promoter for regulating expression of the donor 
DNA. 

30 36. The method according to claim 34, wherein the recombination 

sites on the circular donor Dl^ and the circular acceptor vector are both 
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lox sites. 

37. The method according to daim 34 further comprising steps of 
transfomiing, transfecting or transducing a host cell; and 

5 selecting the correctly fused recombinant vector based on the 

selectable phenotype confenred by the selectable marker gene on the 
recombinant vector. 

38. The method according to claim 37 further comprising: 

10 selecting the conrectly fUsed recombinant vector by using a biotin- 

labeled oligonucleotide which is capable of hybridizing with circular 
double-stranded donor DI*4A. 

39. The method according to claim 34, wherein the recombinase is 
15 selected from the group consisting of the bacteriophage P1 Ore 

recombinase, yeast FLP recombinase, Inti integrase, bacteriophage X, 
phi 80, P22, P2, 186, and P4 recombinase, Tn3 resolvase, the Hin 
recombinase, the Cin recombinase, £ co// xerC and xerD 
recombinases, Badllus thuringlensls recombinase, Tpnl and the p- 
20 lactamase transposons, and the immunoglobulin recombinases. 

40. A method for generating recombinant expression vectors, 
comprising: ligating one or more double-stranded linear donor DNA 
which includes a donor DNA sequence with a double-stranded linear 

25 driver DNA which includes a promoter sequence and a donor 

recombination site to fonm a single circular donor DNA, the circular 
donor DNA not including an origin of replication, the donor DNA 
sequence being under the transcriptional control of tiie promoter and 
contacting the circular donor DNA and a drcular acceptor vector 

30 in the presence of a recornbinase to fbrni a single fused circular vector, 
the circular acceptor vector comprising an origin of replication and an 
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acceptor recombination site capable of recombining with tlie circular 
donor DNA. 

41 . The method according to claim 40, wherein the linear donor DNA 
5 and linear driver DNA contain matching restriction sites. 

42. A method for generating recombinant expression vectors, 
comprising: ligating a library of double-stranded linear donor DMAs, 
where each member of the library includes a donor DNA sequence, with 

10 a double-stranded linear driver DNA which includes a promoter 

sequence and a donor recombination site to fonm a single circular donor 
DNA, the single circular donor DNA not including an origin of replication, 
the donor DNA sequence being under the transcriptional control of the. 
promoter; and 

1 5 contacting the circular donor DNA and a circular acceptor vector 

in the presence of a recombinase to fomn a single fused circular vector, 
the circular acceptor vector comprising an origin of replication and an 
acceptor recombination site capable of recombining with the circular 
donor DNA. 

20 

43. The method acconding to claim 42, wherein the library of doubie- 
stranded linear donor DNAs are DNAs amplified from a library of cDNA 
clones. 

25 44. The method according to claim 43, wherein the library of cDNA 
clones is arrayed in a multi-well plate. 

45. The method according to claim 43, wherein the library of cDNA 
clones is a oosmid or phage library. 

30 

46. The method according to claim 42, wherein ligating the library of 
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double-Stranded linear donor DMAs with a double-stranded linear driver 
DNA is perfbnmed by ligation independent cloning. 
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SEQUENCE LISTING 

<110> Clark, Robin 

<120> COMPOSITIONS AND METHODS FOR GENERATING EXPRESSION VECTORS THROU 
GH SITE-SPECIFIC RECOMBINATION 

<130> 12636-239 

<150> US 09/606,323 
<151> 2000-06-28 

<160> 5 

<170> Patentln version 3.0 

<210> 1 

<211> 34 

<212> DNA 

<213> Artificial sequence: loxp 



<400> 1 

ataacttcgt ataatgtatg ctatacgaag ttat 
4 



<210> 2 

<211> 34 

<212> DNA 

<213> Artificial sequence: loxPSll 

<400> 2 

ataacttcgt atagtataca ttatacgaag ttat 
4 



3 



3 



<210> 3 

<211> 34 

<212> DNA 

<213> Artificial sequence: loxC2 

<400> 3 

acaacttcgt ataatgtatg ctatacgaag ttat 3 
4 



<210> 4 

<211> 34 

<212> DNA 

<213> Artificial sequence: FLP 

<400> 4 
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gaagttccta ttctctagaa agtataggaa cttc 
4 



<210> 5 

<211> io 

<212> PRT 

<213> Artificial sequence: EE-tag 

<400> 5 

Met Glu Glu Glu Glu Tyr Met Pro Met Glu 
1 5 . 10 
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